<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: shijithpk</title><link>https://news.ycombinator.com/user?id=shijithpk</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 15 Apr 2026 11:35:16 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=shijithpk" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by shijithpk in "New coding models and integrations"]]></title><description><![CDATA[
<p>Here you go -- <a href="https://zhipuai.cn/en/aboutus" rel="nofollow">https://zhipuai.cn/en/aboutus</a></p>
]]></description><pubDate>Tue, 21 Oct 2025 12:05:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=45654810</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=45654810</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45654810</guid></item><item><title><![CDATA[New comment by shijithpk in "Improved Gemini 2.5 Flash and Flash-Lite"]]></title><description><![CDATA[
<p>Suppose there's a pdf with lots of tables i want to scrape. I mention the pdf url in my message and with gemini's url context tool, i now have access to the pdf.<p>I can ask gemini to give me the pdf's content as a json and it complies most of the time. But at times, there's an introductory line like "Here's your json:". Those introductory lines interfere with programmatically using the output. They're sometimes there, sometimes not.<p>If I could have structured output at the same time as tool use, I can reliably use what gemini spits out as it'll be in a json, no annoying intro lines.</p>
]]></description><pubDate>Tue, 30 Sep 2025 18:05:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=45428947</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=45428947</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45428947</guid></item><item><title><![CDATA[New comment by shijithpk in "Ingesting PDFs and why Gemini 2.0 changes everything"]]></title><description><![CDATA[
<p>Try with <a href="https://aistudio.google.com" rel="nofollow">https://aistudio.google.com</a> 
Think the page limit is a vertex thing
The only limit in reality is the number of input tokens taken to parse the pdf.
If those tokens + tokens for the rest of your prompt are under the context window limit, you're good.</p>
]]></description><pubDate>Thu, 06 Feb 2025 12:16:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=42961647</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42961647</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42961647</guid></item><item><title><![CDATA[New comment by shijithpk in "Ingesting PDFs and why Gemini 2.0 changes everything"]]></title><description><![CDATA[
<p>This is for anyone coming across this link later. In their latest SDKs, if you want to completely switch off their safety settings, the flag to use is 'OFF' and not 'BLOCK_NONE' as mentioned in the docs in the link above.<p>The Gemini docs don't refect that change yet.
<a href="https://discuss.ai.google.dev/t/safety-settings-2025-update-broken-again/59360/25" rel="nofollow">https://discuss.ai.google.dev/t/safety-settings-2025-update-...</a></p>
]]></description><pubDate>Thu, 06 Feb 2025 12:05:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=42961586</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42961586</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42961586</guid></item><item><title><![CDATA[New comment by shijithpk in "I used Google Gemini Pro to compare party manifestos for an Indian election"]]></title><description><![CDATA[
<p>Because mobile design is all about fitting in text wherever you can, right? :)</p>
]]></description><pubDate>Mon, 03 Feb 2025 18:07:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=42921013</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42921013</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42921013</guid></item><item><title><![CDATA[New comment by shijithpk in "I used Google Gemini Pro to compare party manifestos for an Indian election"]]></title><description><![CDATA[
<p>>  There is a disclaimer for this on the desktop page. But when rendered on mobile it isn't there.<p>Hmm, wonder why this guy chose to do it this way. Was it a design decision? Was it because of the tight space? We will never know.</p>
]]></description><pubDate>Mon, 03 Feb 2025 16:40:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=42920048</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42920048</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42920048</guid></item><item><title><![CDATA[New comment by shijithpk in "I used Google Gemini Pro to compare party manifestos for an Indian election"]]></title><description><![CDATA[
<p>i am writing about this on hacker news, have posted a twitter thread about it and also put up my code on Github. Nothing is being hidden here mate, dont know how much more transparent I can get :)</p>
]]></description><pubDate>Mon, 03 Feb 2025 14:25:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=42918490</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42918490</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42918490</guid></item><item><title><![CDATA[New comment by shijithpk in "I used Google Gemini Pro to compare party manifestos for an Indian election"]]></title><description><![CDATA[
<p>Hi, my name is Shijith and I'm a data journalist from India.<p>So we have a state election in Delhi this week (Feb 5), and I wanted to make a guide to help residents decide who to vote for.<p>Now parties usually declare their political platforms, their programs etc. in manifestos released before the election.<p>These manifestos are 20-30 pages long sometimes, and can be a slog to get through.<p>And if there is a particular issue you're worried about, say pollution, it's difficult to compare them all at once.<p>I decided to bring AI into the picture to compare the manifestos on 10 key issues like pollution and eduction.<p>Specifically, I used Google's AI model Gemini 1.5 Pro to scrape the various pdfs and generate the text for the comparison.<p>The github repo contains the python code I used to interact with the API.<p>You can see the guide I created at <a href="https://data.indianexpress.com/projects/manifesto-guide" rel="nofollow">https://data.indianexpress.com/projects/manifesto-guide</a><p>I also wrote a long twitter thread that goes into more detail on how this was made: <a href="https://x.com/shijith/status/1879145563234648564" rel="nofollow">https://x.com/shijith/status/1879145563234648564</a><p>If you have any questions or feedback, do let me know. Thanks! -shijith</p>
]]></description><pubDate>Mon, 03 Feb 2025 14:05:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=42918333</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42918333</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42918333</guid></item><item><title><![CDATA[I used Google Gemini Pro to compare party manifestos for an Indian election]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/shijithpk/manifesto_work_2025_delhi">https://github.com/shijithpk/manifesto_work_2025_delhi</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42918328">https://news.ycombinator.com/item?id=42918328</a></p>
<p>Points: 3</p>
<p># Comments: 8</p>
]]></description><pubDate>Mon, 03 Feb 2025 14:05:24 +0000</pubDate><link>https://github.com/shijithpk/manifesto_work_2025_delhi</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=42918328</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42918328</guid></item><item><title><![CDATA[New comment by shijithpk in "[dead]"]]></title><description><![CDATA[
<p>I'm a data journalist and I live in a village in Kerala, a state in southern India.<p>We're getting into summer here, so I wanted to see how hot villages in Kerala get in general, and how the temperatures now compare to 10 or so years ago.<p>Made use of satellite data for my analysis, mainly land surface temperature data from MODIS.<p>Was thinking of using Google Earth Engine for this story, but decided to go with Planetary Computer since it's Python-centric and I'm more familiar with that language. (GEE is more javascript-oriented.)<p>Pretty sure earth/geo/environment science guys won't be happy with my methodology, but hopefully there's enough of a logic to it for the story to be acceptable as a basic analysis.<p>You can see my code at the github repo (<a href="https://github.com/shijithpk/hottest-panchayats-kerala">https://github.com/shijithpk/hottest-panchayats-kerala</a>)<p>Have any suggestions or feedback, let me know, thanks!</p>
]]></description><pubDate>Fri, 24 Feb 2023 07:49:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=34922068</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=34922068</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=34922068</guid></item><item><title><![CDATA[New comment by shijithpk in "[dead]"]]></title><description><![CDATA[
<p>Hi, my name is Shijith and I'm a freelance data journalist from India.<p>Just wanted to plug my new story on streaming services in India and how well they cover western music.<p>I specifically look at much access they give to critically-acclaimed albums from the past and present.<p>I used features like 'Best Albums Ever', 'Best albums of 2021' and so on from top music publications and websites to come up with a list of albums in each genre (Rock, EDM etc.) these services should have.<p>Services were rated out of 10 in each genre, with the rating corresponding to what percent of the album list they have in their library. Guess it won't be a surprise for people in India that Apple or Spotify came first in most lists.<p>In the overall rating that looks at performance across all genres, the top four spots were taken by global services — Spotify (9.3/10), Apple Music (9.3), YouTube Music (8.9) and Amazon Music (8.5). Of services based in India, only JioSaavn (8.1) came anywhere close.<p>There's all this and more in my blog post at <a href="https://shijith.com/blog/music-streaming-india/" rel="nofollow">https://shijith.com/blog/music-streaming-india/</a><p>And if you're interested in my code it's here:  <a href="https://github.com/shijithpk/music-streaming-india" rel="nofollow">https://github.com/shijithpk/music-streaming-india</a><p>If you have any questions or feedback, do let me know. Thanks! -shijith</p>
]]></description><pubDate>Sun, 11 Sep 2022 07:02:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=32797447</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=32797447</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32797447</guid></item><item><title><![CDATA[New comment by shijithpk in "Which Wikipedia pages in India are abused the most?"]]></title><description><![CDATA[
<p>Hi, my name is Shijith, and I'm a freelance data journalist from India (Worked previously at Hindustan Times and IndiaSpend).<p>Just posting a data story I did recently about wikipedia abuse in India. Such abuse is an old problem, but it's getting more media attention with users distorting facts on pages about the Delhi riots or farmer protests. Sometimes users engage in straight out vandalism where they delete whole sections from a page.<p>I tried to determine which wikipedia pages faced the most abuse this year, and also introduce a twitter account that allows people to track wikipedia abuse weekly.<p>This is the twitter account for tracking wikipedia abuse every week: <a href="http://twitter.com/abuse_checker" rel="nofollow">http://twitter.com/abuse_checker</a><p>And here's the python code I used for the project: <a href="https://github.com/shijithpk/wikipedia_abuse_checker" rel="nofollow">https://github.com/shijithpk/wikipedia_abuse_checker</a><p>(Am in the process of re-working the code. Right now it's querying the wikipedia API every week for edit histories of over 150k articles, and the whole run is taking 2 days now. Discovered an API endpoint for recent changes that should make things more efficient.)<p>Have any questions or feedback, do let me know below!</p>
]]></description><pubDate>Tue, 21 Dec 2021 09:13:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=29635506</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=29635506</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29635506</guid></item><item><title><![CDATA[Which Wikipedia pages in India are abused the most?]]></title><description><![CDATA[
<p>Article URL: <a href="https://shijith.com/blog/wikipedia-page-abuse/">https://shijith.com/blog/wikipedia-page-abuse/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=29635493">https://news.ycombinator.com/item?id=29635493</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 21 Dec 2021 09:11:28 +0000</pubDate><link>https://shijith.com/blog/wikipedia-page-abuse/</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=29635493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29635493</guid></item><item><title><![CDATA[New comment by shijithpk in "Hide like counts on Twitter and karma points on Reddit with uBlock Origin"]]></title><description><![CDATA[
<p>Just submitting a link here for visibility, hopefully it'll come up in someone's google search later on and help them. The blogpost is about a list of filters I came up with to try and make websites like Twitter and Reddit less stressful. (List of filters at <a href="https://github.com/shijithpk/hide_like_counts_with_ublock" rel="nofollow">https://github.com/shijithpk/hide_like_counts_with_ublock</a> )<p>You copy the filters into the adblocker uBlock Origin, after which all elements on their webpages that show any stats, eg. like counts, retweet counts, karma points, upvotes etc. get hidden.<p>This is mainly for desktop users. You could possibly use these filters in adblockers other than uBlock Origin, not tested it though.<p>Hopefully, Twitter and Reddit will introduce settings like Instagram/Facebook have to hide like counts, but till that happens, this filter list is a handy option.</p>
]]></description><pubDate>Fri, 30 Jul 2021 03:25:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=28004103</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=28004103</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28004103</guid></item><item><title><![CDATA[Hide like counts on Twitter and karma points on Reddit with uBlock Origin]]></title><description><![CDATA[
<p>Article URL: <a href="https://shijith.com/blog/hiding-like-upvote-counts/">https://shijith.com/blog/hiding-like-upvote-counts/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=28004029">https://news.ycombinator.com/item?id=28004029</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 30 Jul 2021 03:09:52 +0000</pubDate><link>https://shijith.com/blog/hiding-like-upvote-counts/</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=28004029</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28004029</guid></item><item><title><![CDATA[New comment by shijithpk in "Show HN: A Python package that makes it easier to work with lists on Twitter"]]></title><description><![CDATA[
<p>twitter_list_mgmt is a python package I created to make it easier to add users to your Twitter list from other lists, among other things.<p>Say you've created a covid twitter list to keep track of news around the pandemic. You've just found another list on covid curated by an epidemiologist in London, and you want to add members from that to your own Covid list. This is the package you use for it.<p>Now for most basic operations like retrieving the current membership of a Twitter list, adding users to it, removing them etc. the Tweepy library is good enough. twitter_list_mgmt just adds extra functionality on top of Tweepy to make working with lists easier.<p>This package will help heavy twitter and tweetdeck users, especially those who use lists to manage the firehose of information from social media.<p>Have any questions or comments, do let me know! Also, not a professional developer/programmer, so be nice :)</p>
]]></description><pubDate>Mon, 12 Jul 2021 06:46:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=27807758</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=27807758</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27807758</guid></item><item><title><![CDATA[Show HN: A Python package that makes it easier to work with lists on Twitter]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/shijithpk/twitter_list_mgmt">https://github.com/shijithpk/twitter_list_mgmt</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=27807729">https://news.ycombinator.com/item?id=27807729</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 12 Jul 2021 06:42:18 +0000</pubDate><link>https://github.com/shijithpk/twitter_list_mgmt</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=27807729</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27807729</guid></item><item><title><![CDATA[New comment by shijithpk in "Show HN: Find new music on Spotify by aggregating the choices of tastemakers"]]></title><description><![CDATA[
<p>I'm an amateur python coder, and this is a basic script that makes it easier to find new music. It does this by essentially combining playlists from influential music websites and radio stations into a single one on Spotify. There are around 75 sources you can choose from, mainly from the US and UK, but you can also add other sources. It creates a playlist for you in your Spotify library, and makes sure that only the newest tracks get added to it every week. Have written about my motivations for coding this at <a href="https://shijith.com/blog/automating-music-discovery/" rel="nofollow">https://shijith.com/blog/automating-music-discovery/</a></p>
]]></description><pubDate>Tue, 22 Jun 2021 07:17:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=27588872</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=27588872</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27588872</guid></item><item><title><![CDATA[Show HN: Find new music on Spotify by aggregating the choices of tastemakers]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/shijithpk/music-discovery">https://github.com/shijithpk/music-discovery</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=27588770">https://news.ycombinator.com/item?id=27588770</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 22 Jun 2021 06:57:37 +0000</pubDate><link>https://github.com/shijithpk/music-discovery</link><dc:creator>shijithpk</dc:creator><comments>https://news.ycombinator.com/item?id=27588770</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27588770</guid></item></channel></rss>